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The history of human populations in Africa is complex and 
includes various demographic events that influenced patterns 
of genetic variation across the continent. Through genetic 
studies of modern-day, and most recently, ancient African 
genetic variation, it became evident that deep African history is 
captured by the relationships among hunter-gatherers. 
Furthermore, it was shown that agriculture had a large influence 
on the distribution of current-day Africans. These later 
population movements changed the demographic face of the 
continent and descendants of farming groups today form the 
majority populations across Africa. Ancient DNA methods are 
continually evolving, and we see evidence of this in how 
research has advanced in the last decade. With the increased 
availability of full genomic data from diverse sets of modern- 
day and prehistoric Africans we now have more power to infer 
human demography. Future ancient DNA research promises to 
reveal more detailed stories of human prehistory in Africa. 
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Investigating human history using ancient 
DNA 

Rapid developments in the ancient DNA (aDNA) field 
extended the power of genetics to make inferences about 
the history of our species [1]. Studying aDNA offers a 
unique opportunity to access genetic variation of past 
populations and enables us to put past populations in 
context of present day genetic variation. Human groups 
from all regions of the world, have unique histories of 
migration, admixture, and adaptation, which have shaped 
their past. Fora long time, human prehistory on the African 
continent has been actively investigated by the disciplines 
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of archaeology, paleoanthropology and linguistics [2-5]. 
Although these fields have greatly contributed to our 
understanding of our species deep history on the African 
continent as well as later population migrations in Africa, 
key questions remain unanswered. For instance, whether 
changes in material culture observed in the archaeological 
record are the result of the physical migration of people, or 
alternatively, due to the spread of ideas. In this context, 
aDNA studies are ideal to study migration history by 
directly accessing the genetic variation of pre-historic indi- 
viduals, and comparing this variation to other pre-historic 
individuals from different temporal and geographic con- 
texts and also to modern-day populations. However, due to 
the difficulty in extracting DNA from human samples from 
warm climates and the initial focus on European aDNA 
studies, genome-wide aDNA studies in Africa presently 
number only few (‘Table 1) [6,7,8°°,9°°,10,11°°,12°°, 13,14]. 
Only nine ancient genome-wide studies of African individ- 
uals have been published thus far: four for North Africa and 
five for sub-Saharan Africa (‘lable 1). Here, we examined 
the results of aDNA studies from continental Africa pub- 
lished to date, in the context of modern-day genetic varia- 
tion (Figure 1). 


Ancient DNA in North Africa 

In North Africa, modern-day groups are largely related to 
Eurasian and Middle Eastern populations, with very low 
levels of genetic contributions from sub-Saharan Africa 
(blue dots, Figure 1) [15,16]. A debate existed whether 
this was the result of Paleolithic back-to-Africa migra- 
tions, or migrations connected to the introduction of 
farming practices to North Africa during the Neolithic 
[15,17]. A study on 15 kya old remains from Morocco 
(blue diamonds, Figure 1) demonstrated that Northern 
Africa received significant amounts of gene-flow from 
Eurasia predating the Holocene and development of 
farming practices [11°°]. aDNA studies further found that 
Early Neolithic North Africans (~7 kya) trace their ances- 
try to these Paleolithic North African groups [12°°], while 
Late Neolithic groups (~5 kya) contained an Iberian 
component, indicating ¢vans-Gibraltar gene-flow. These 
two different signals in Early and Late Neolithic individ- 
uals, indicate that the spread of farming practices in North 
Africa involved both the movement of ideas and people. 


‘The Sahara poses a geographic barrier to human migra- 
tion, aside from intermittent greening periods [18]. Stud- 
ies on modern-day populations [15,16] and aDNA from 
Egyptian mummies [13] indicated that gene-flow from 
the south, across the Sahara into modern-day North 
Africans, were low and appeared in recent times. More 
ancient individuals from Morocco, however, seem to have 
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Table 1 


Summary of genome-wide African ancient DNA studies published to date 


Ancient DNA in Africa Vicente and Schlebusch 9 


Study Region Current day Date range Context* N Capture/Full Coverage 
African counties Cal BP/BP genomes 
Schuenemann et a/., Northeast Africa Egypt 2726 - Pre-Ptolemaic 3 Capture 132 084 to 
2017 [13] 1952 calBP Period, 508 360 SNPs 
Ptolemaic 
Period 
Rodriguez-Varela Northwest Africa Canary Islands 951-1421 - 5 Full genomes 0.21 to 3.93x 
et al., 2017 [14] BP 
Van Loosedrecht Northwest Africa Morocco 15,100 - 13,900 LSA/ 5 Capture 343,764 to 1,013,869 
et al., 2018 [11°°] calBP Paleolithic SNPs 
Fregel et al., Northwest Africa Morocco 7,275 - 5,600 Early Neolithic 13 Full genomes and Full gen: 0.01 to 0.74x 
2018 [12°] calBP Late Neolithic capture Capt: 0.04 to 1.72x 
Gallego Llorente East Africa Ethiopia 4500 calBP LSA 1 Full genome 12.5x 
et al., 2015 [6] 
Skoglund et al., East and southern South Africa, 8173-322 calBP LSA 12 Capture Full gen: 0.7 to 2.0x 
2017 [7] Africa Tanzania, Malawi PN 3 Full genomes Capt: 9,355 to 
IA 845,016 SNPs 
Schlebusch et al., Southern Africa South Africa 2296 — 282 calBP LSA 7 Full genomes 0.01 to 13.2x 
2017 [10] IA 
Prendergast et al., East Africa Kenya, 4080—160 calBP LSA 41 Capture 0.01-3.9x 
2019 [8°°] Tanzania PN 
PIA 
IA 
Lipson et al., West Africa Cameroon 7,920—2,970 Stone to Metal 4 Capture Capt: 0.7-7.7x 
2020 [9°*] BP Age 2 Full genomes Full gen: 3.9-18.5x 


“ Context: LSA — Later Stone Age; PN — Pastoral Neolithic; PIA — Pastoral Iron Age; and IA — Iron Age. 


had higher affinities to sub-Saharan Africans [11°°,12°°] 
(Figure 1b). The Paleolithic (15 kya) and Early Neolithic 
(7 kya) individuals lived before and during the most 
recent Green Sahara period (stretching from 12 kya to 
5 kya), and yet they have similar genetic compositions 
with similar affinities to sub-Saharan Africa, while mod- 
ern-day North Africans have very little sub-Saharan Afri- 
can contribution. Consequently the cycling of the Sahara 
through its wet and dry phases seems to have had an 
influence on amount of gene-flow between North and 
Sub-Saharan Africa, although the exact dynamics of those 
migrations needs to be further investigated, ideally 
through genome-wide aDNA studies. 


Sub-Saharan Africa 

Modern and ancient DNA studies for sub-Saharan Africa 
indicated that this part of the continent had two very 
different phases in its history (Figure 2) (Reviewed in 
Ref. [19°*]). During deep history, before the invention of 
farming practices, hunter-gather groups seemed to have 
been related in an ‘isolation-by-distance’ fashion where 
geographic location closely reflected genetic relatedness 
of groups [7,20]. Although long distance migrations by 
hunter-gatherers in the past cannot be ruled out, more 
research in this regard is needed and aDNA studies will 
be invaluable to test this. The early history of the African 
continent is in stark contrast with the large population 
movements, within relatively short time periods, that 
followed the invention of farming practices in Africa 


(Figure 2) [19°°,21]. The origins of food producing prac- 
tices in Africa are still unclear but it is believed that crop 
cultivation was developed independently, in at least three 
regions: the Sahara/Sahel (around 7 kya), the Ethiopian 
highlands (~7—4 kya), and Western Africa (~5—3 kya) 
[3,22]. From these centers, farming practices spread to the 
rest of Africa, with domesticated animals reaching the 
southern tip of Africa around 2 kya and crop farming 
around 1.8 kya [3]. Genomic investigations of present-day 
and past humans contributed to hypotheses regarding the 
spread of farming groups in Africa. 


Migrations of food producers in sub-Sahara 
Africa 

The first African aDNA nuclear genome to be sequenced 
was that of a 4.5 kya individual (from Mota, Ethiopia, 
Figure la) [6]. The comparison of this genome and 
present East African populations revealed a Eurasian 
back migration into East Africa. The Mota genome did 
not harbor any evidence of a back migration [6], while 
more recent populations had clear evidence of increased 
Eurasian admixture compared to the Mota genome [23]. 
The consequences of this migration can be seen on the 
PCA (Figure 1), where current day East African popula- 
tions (beige dots) and ancient East African individuals 
from farming contexts (beige diamonds) are intermediate 
between the Mota individual (yellow diamond - repre- 
senting non-admixed East African hunter-gatherers) and 
non-Africans (blue dots). However, subsequent genetic 
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Modern-day and ancient genetic structure in Africa. (a) Geographic locations of modern-day populations (colored dots) and ancient individuals 
(diamonds with border). (b-d): PCA’s rotated with Procrustes analysis to fit geography. Procrustes correlation factors between PCA space and 
geographic space are indicated on the figures (b) PC1 versus PC2. (c): PC1 versus PC3. (d) PC1 versus PC4. (e) Averaged per population 
clustering analysis, ADMIXTURE results at K=5. PCA and ADMIXTURE analysis were performed on overlapping SNP positions obtained from 
[8°°,20,24,25,49-54]. Positions with more than 0.15 missingness were discarded, after quality-control 182 977 LD-pruned SNPs were analyzed. 
Only aDNA samples with at least 15 000 of overlapping SNPs were considered in the analyses. 
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African population history before and after farming. (a) Map of major pre-farming population stratification across the African continent, showing 
non-Africans and North Africans (blue), East Africans (yellow), West Africans (gray), Central African rainforest hunter-gatherers (green), and 
Southern African Khoe-San hunter-gatherers (red). (b) Migration routes related to the expansion of herders and crop farmers during the Holocene. 
The gray arrows represent the Bantu expansion, blue arrows represent the Eurasian back-migrations, brown/blue arrow represents the southward 
migration of admixed East African-Eurasian pastoralists, and gray/brown arrow represents bidirectional migration along the Sahel belt. 


Abbreviation: kya — thousand years ago. 


studies showed that certain Northeast African groups, for 
example, for example Dinka and Nuer groups from Sudan 
[23] and the Sabue East African hunter-gatherers [24] 
(Figure le), until today had little to no Eurasian 
admixture. 


A study that sequenced 15 ancient Africans across East- 
ern and Southern Africa dating as far back as 8.1 kya [7], 
found that ancient herders carrying this East African- 
Eurasian ancestry, migrated all the way down into south- 
erm Africa introducing herding practices to the south 
(beige diamonds and dots in Eastern and Southern 
Africa, Figure 1b and yellow-blue component in 
S.Africa_LSA_Pastoralist_1200BP, Figure le). This 
confirmed previous findings based on modern-day auto- 
somal [25-27], Y-chromosome [28] and Lactase Persis- 
tence [29-31] variants, which indicate the presence of an 
East African component in Southern African herders 
(yellow-blue component in Southern African herding 
population ‘Nama’, Figure le). 


The introduction of farming in East Africa was further 
refined by a study of 41 individuals associated with Later 
Stone Age (LSA), Pastoral Neolithic (PN), and Iron Age 
(IA) contexts in current-day Kenya and Tanzania [8°*]. 
The study inferred two phases of admixture associated 
with the spread of pastoralists. The first event likely 
occurred ~6—5 kya in Northeastern Africa between 
groups carrying non-African genetic ancestry (related to 
groups from the Levant or North Africa) and local north- 
east African groups (related to present-day Dinka/Nuer). 
The second event occurred ~4 kya between this admixed 
group and Eastern African hunter-gatherers (related to 
Mota and Kenyan LSA individuals). The route of this 
non-African component into East Africa is as yet unclear, 
proposed to be either through the Nile-valley or the horn 
of Africa. Results also pointed to at least two chronologi- 
cally distinct southwards movements of herders into 
Eastern Africa, giving rise to the PN culture. Interest- 
ingly, genetic analyses failed to find any differentiation 
between two archacologically distinct PN cultures, 
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Elmenteitan and Savanna Pastoral Neolithic cultures 
(Figure le) [8°°]. This observation suggests that archae- 
ologically distinct cultures do not necessarily imply 
genetically distinct groups. The subsequent arrival of 
the Iron Age in Eastern Africa was similarly complex. 
Sudan related gene flow (Kenya_Pastoral_IA, Figure 1) 
closely followed by western African gene-flow related to 
present-day Bantu-speakers (Kenya_IA_Deloraine and 
Tanzania_Pemba_600BP), mark the onset of the Iron 
Age in the region and the introduction of crop farming 


[7;3""]. 


‘The Bantu-expansion began around ~5—3 kya in West- 
ern Africa (Figure 2b) and is one of the largest farmer 
expansion events globally [32-34]. The archaeological 
record for West Africa show that increased sedentism 
was followed by the spread of agricultural practices 
and, later on, the use of iron [3,33]. Today, in most of 
sub-Saharan Africa, Bantu languages are the majority 
languages. Earlier genetic studies have indicated that 
the current distribution of Bantu-speaking populations 
is largely a consequence of the movement of people 
(demic diffusion) rather than a diffusion of only language 
and farming practices [25,32,34,35]. This has been con- 
firmed by aDNA studies; ancient remains found in Iron 
Age contexts, in East [7,8°°] and Southern Africa [10] 
group genetically with current day West African groups 
(gray diamonds and dots, Figure 1). 


A study that sequenced the nuclear genomes of 7 ancient 
Southern African individuals, found that the three LSA 
individuals (2 kya — red diamonds in Southern Africa — 
Figure 1) were related to current day Khoe-San hunter- 
gatherers (red dots) and the four Iron Age individuals 
(300-500 ya — gray diamonds in Southern Africa — Figure 1) 
to current day West Africans (gray dots). This study con- 
firmed a large-scale population replacement in southern 
Africa, where LSA ancestors of Khoe-San hunter-gatherers 
were replaced by incoming Bantu-speaking farmers with 
West African ancestry. In South Africa, Bantu-speakers 
received substantial amounts of gene-flow from local 
Khoe-San hunter-gatherers (e.g. see Zulu and Sotho popu- 
lations, Figure le). In contrast, current-day populations 
from Malawi [7] and Mozambique [36°] show little to no 
admixture with hunter-gather groups that occupied the 
area before the Bantu-expansion, indicating that the migra- 
tion of Bantu-speakers was a complex process. Further 
aDNA studies and better coverage of modern-day sub- 
equatorial African populations are needed to further clarify 
the demographic dynamics and migration routes during the 
Bantu-expansion. 


Deep population history in sub-Sahara Africa 
Before the large-scale migrations of farming groups across 
the continent, hunter-gatherer groups seemed to have been 
related in clinal patterns with distance between groups 
determining relationships, with neighboring groups being 


more related to each other than groups whoare separated by 
large geographic distances. This isolation-by-distance pat- 
ternis still reflected in the relatedness between present day 
hunter-gatherer groups, illustrated by a study on Southern 
Africa Khoe-San hunter-gatherers [20]. Likewise, an 
aDNA study [7] found a genetic cline between hunter- 
gatherer populations of Eastern and Southern Africa, before 
farming (yellow to red cline on PCA — Figure 1). This cline 
might also have extended to West-Central African ancient 
HGs from Shum Laka (Cameroon) [9°°] related to current- 
day West-Central African rainforest hunter-gatherers 
(RHGs) (green diamonds and dots, Figure 1). The Shum 
Laka samples fall intermediate between Southern African 
hunter-gatherers and West African Niger-Congo speakers 
on PC 1, Figure 1b. A PCA that included only ancient and 
modern-day hunter-gatherers (Figure 3) show a Procrustes 
correlation factor with geography of 77% (when farmers are 
included the coefficient is reduced to 30-39 % - Figure 1). 


Although the genetic ancestries of ancient and modern 
hunter-gatherers, might suggest long-term clinal relation- 
ships between groups across the African landscape, data 
from more aDNA studies are needed to determine 
whether any large-scale migrations underlies this pattern. 
From European modern-day [37] and subsequent aDNA 
studies [38], we have learnt that isolation-by-distance 
patterns can mask several large-scale movements and 
replacements in the distant past, especially if no un- 
admixed present day groups are remaining. Thus the 
deep history of Africa remains to be clarified, but aDNA 
studies already started to contribute inferences. 


A 2 kya high-coverage LSA ancient genome from South- 
ern Africa [10], revealed that all modern Khoe-San 
groups, including the Jul|’hoansi group, have 9-22 % 
admixture from the Eurasian-East African group who 
introduced herding into Southern Africa. This can be 
seen on the PCA (Figure 1b) as a shift in modern-day 
Khoe-San groups (red dots) towards the East African side 
of the PCA (some dots are also shifted to the West African 
corner due to Bantu-speaker admixture). The Ju|’hoansi 
was used previously as a Khoe-San representative group 
with very little admixture from neighboring groups, to 
date the first divergence event between modern human 
populations (estimated at ~100 kya and ~200 kya, using 
the updated human mutation rate) [25,39,40]). When the 
non-admixed LSA individual was compared to other 
Africans the divergence time was pushed back to between 
260-350 kya [10], towards the genesis of the Middle 
Stone Age when humans became morphologically and 
behaviorally modern. The study found a first divergence 
between, Khoe-San and all other populations (mean of 
322 kya), followed by the split of RHGs (221 kya) fol- 
lowed by a West versus East Africans split (137 kya) [10]. 
While hierarchical bifurcating tree models provide a good 
estimation of general relatedness between groups, it is 
certainly a simplified representation of human history, 
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PCA including only sub-Saharan modern-day (colored dots) and ancient hunter-gatherers (diamonds with borders), attempting to represent pre- 
farming population structure. The Qatari population (light blue) was used to extract Eurasian admixture on PC1 (not shown). PC 2 and 3 were 
rotated to geography. The Procrustes correlation factor with geography is 77%, compared to 30-39% when farmers are included - Figure 1. 


and more complex models incorporating gene-flow, 
migration and deep population structure need to be 
considered in future research [41°]. 


Several studies have already reported evidence of deep 
coalescencing lineages in West Africa, suggesting possi- 
ble deep rooting admixture by a ‘ghost’ population — a 
population distantly related to modern humans that does 
not exist as a separate population today [7,10,42—-44]. A 
recent study [45] also suggested extensive Neanderthal 
admixture in African populations, adding additional deep 
structure to the African demographic picture. The study 
by Lipson e a/. [9°*] suggested radiations between four 
ancestral groups rather than a consecutive split model. 
This fourfold radiation involved lineages leading to; 
Khoe-San hunter-gatherers, Central African RHGs, East 
and West Africans, and a ‘ghost modern’ population. 
Additionally, a ‘ghost-archaic’ source was also added to 
the model and West African groups contained low 
amounts of ghost-archaic and larger amounts of ghost- 
modern admixture. There were however other models 
that also fitted their data. 


With more and more evidence from the paleoanthropo- 
logical and genetics fields, suggesting a possible multire- 
gional origin of modern humans on the African continent 
[19°°,41°,46,47], future studies on deep African history 
need more rigorous testing with simulations of realistic 
models [41°]. In addition, geographic and climatic model- 
ling need to be incorporated in these models [48], 
together with time-serial data obtained from aDNA stud- 
ies. Although this embodies a demanding task, some 
frameworks have already been laid [41°,48] and as more 
aDNA data becomes available together with genomic 
data from modern-day African populations, future 
research will further clarify the picture of our deep 
genetic history in Africa. aDNA research together with 
complimentary research from other fields, will continue to 
reveal more nuanced stories of human prehistory, and 
these stories might help us to understand the past, present 
and the future of our species 
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